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I discuss theories that describe fully nonlinear physics, while being practically linear (PL), in 
that they require solving only linear differential equations. These theories may be interesting in 
themselves as manageable nonlinear theories. But, they can also be chosen to emulate genuinely 
nonlinear theories of special interest, for which they can serve as approximations. The idea can be 
applied to a large class of nonlinear theories, exemplified here with a PL analogs of scalar theories, 
and of Born-Infeld (BI) electrodynamics. The general class of such PL theories of electromagnetism 
are governed by a Lagrangian £ = — (l/2)F l _ lv Q ,lv +S(Q^ V ), where F^ — A v%l 
couples to currents in the standard way, while Q M „ = Bu,^ 

not couple directly to currents. By picking a special form of <S(Q M „), we can make such a theory 
similar in some regards to a given fully nonlinear theory, governed by the Lagrangian —lk{F llv ). 
For example, by "similar" we may imply that the theories are equivalent to second order in the 
expansion for weak fields, and that they are also equivalent for static configurations with one- 
dimensional symmetry (e.g., near point charges). A particularly felicitous choice, which implies the 
above similarities, is to take S as the Legendre transform of U in the variables F^ v . For the BI theory, 
this Legendre transform has the same form as the BI Lagrangian itself: S{Q^,El) = U(Q liv ,-E%) 
(Eo is the limiting field of the BI theory). Various matter-of-principle questions remain to be 
answered regarding such theories. As a specific example, I discuss BI electrostatics in more detail. 
As an aside, for BI, I derive an exact expression for the short-distance force between two arbitrary 
point charges of the same sign, in any dimension. 
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I. INTRODUCTION 



Nonlinear systems are rife in physics. My focus here is on theories for which the Lagrangian density is a nonquadratic 
\ function of the derivatives of some of the degrees of freedom (DoF). 

Some examples of such scalar systems in Euclidean space are: (i) Electrostatics in nonlinear dielectrics, magne- 
\^ ' tostatics in the presence of superconductors, or nonlinear transport systems (e.g., nonlinear diffusion), (ii) Inviscid, 
irrotational, compressible, stationary flows, (iii) The problem of volume extremization. (iv) Alternative theories of 
^2 ' gravity replacing the Poisson equation in the description of nonrelativistic gravity by a nonlinear version [1]. 

Other important examples of nonlinear physics involving vector fields are the different versions of nonlinear electro- 
ns) ■ dynamics, such as that governed by the Heisenberg-Euler Lagrangian (see, e.g., [2]), and the Born-Infeld (BI) theory 
with its more recent generalizations. Born-Infeld theories have been repeatedly coming back into the limelights since 
their advent, almost eighty years ago, because of their unique properties (see, e.g., [3] [4]), and, in particular, they have 
attracted much attention in recent years, because they appear as effective theories in the context of string theory, 
(e.g., [5]). 

Such theories are notoriously unwieldy due to their nonlinearity. Here I discuss a type of theories that describe non- 
linear physics, while being practically linear (PL), requiring solving only linear differential equations, the nonlinearity 
entering only algebraically. 

Even if such PL theories are not forced on us by nature, they may be useful as wieldy NL theories that embody 
many of the attributes of genuinely nonlinear theories. Furthermore, for a given nonlinear theory of the type focused 
on here, we can find a kindred among the PL theories that mimics it in some regards, and could thus serve as a useful 
approximation. 

The theories I discuss here came to light as a result of attempting to find approximations for the NL Poisson, 
modified-gravity theory alluded to in (iv) above. In [6] I described such a PL theory, called QUMOND; I showed 
that it can be considered a full-fledged theory on its own right (there even is a covariant relativistic MOND theory, 
for which QUMOND is the nonrelativistic limit [7]), and I showed that it may be so chosen as to approximate the 
nonlinear Poisson theory in various circumstances. I also discussed some of the differences between the two theories. 
This PL version of MOND has since been put to good use for predicting and calculating MOND effects in the solar 
system [8] [9], for calculating MOND fields of galaxies (e.g. [10]), and structure formation in MOND [11]. Here, I 
essentially extend the concept to more general NL problems. 

I first demonstrate the idea with theories for scalar fields, in section II. Section III deals with nonlinear electro- 
dynamics, and PL analogs of Born-Infeld electrodynamics. In section IV, I describe a simple application to special- 
relativistic particle kinematics. Section V discusses BI electrostatics in more detail, as an example of an application. 
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In section VI, I list some of the many aspects that remain to be checked and considered. 



II. SCALAR THEORIES 



Consider a NL theory involving one real, scalar field, ci, that is governed by an action of the form 

I = -jdVU(^) + I q , (1) 

where dV is the appropriate volume element. The action I q = — J q<j> dV includes the interaction of c/> with other 
DoF, and is assumed to depend linearly on <p. The field equation for <p is then 

( dli\ 

= <1, (2) 



where the charge distribution, q, depends on the configuration of other DoF, but is independent of (j>. If the theory 
is rotationally-invariant (or Lorentz-invariant, or diffcomorphism invariant, depending on the background space) the 
action is of the form 



/= -/ dVU(cP^<f> tV ) + I q . 



(3) 



Here, </"" is the (fixed) metric of the background space (assumed, for simplicity, to be flat Euclidean or Minkowski in 
what follows). The field equation for <j> is then 

2(W/"^),» - g(z"). (4) 

For the nonlinear systems mentioned in the introduction we have: (i) In nonlinear dielectrics, nonlinear transport 
systems, etc., W is the response coefficient, which is a function of the field strength, and q represents the density 
of sources, (ii) In ideal, irrotational, compressible, stationary flow problems, W is the fluid density, which can be 
expressed as a function of the fluid velocity v = VcA through the Bernoulli equation; q is the source density. For 
example, in a fluid with an equation of state of the form p = ag 1 (a > 0, 7 > 1), we have U'{z) oc [1 — (z/zo) 2 ] 1 ^ J ~ 1 \ 
with Zq = 2cq/ (7— 1), and c is the speed of sound at z = 0. (iii) In the problem of volume extremization of an (N— 1)- 
dimensional manifold x\ = 4>{ x 2, x N ), embedded in an TV-dimensional Euclidean space, we have U(z) oc (1 + z 2 ) 1 / 2 , 
and q(x2, ...,x N ) may be understood as the density of external forces in the x\ direction (as for a loaded soap film 
in a constant gravitational field). The above theories, as many others, tend to a linear theory in the limit of weak 
(gradient) fields: in these cases, U'(0) is a finite constant, (iv) in MOND gravity, which replaces the Poisson equation 
for the gravitational potential by a nonlinear version [1], we have U(z) oc a 2 F(z/a 2 ) (a is the acceleration constant 
of the theory). This theory is unique among the rest presented above in that it tends to the linear Poisson theory 
in the strong-field limit: U'(z — > 00) — > const., while in the weak-field limit U'{z C 1) oc z 1 / 2 , in order to reproduce 
galaxy dynamics without "dark matter" . 

A. The practically linear theory 

Introduce the PL analog theory for a single scalar as follows: Start with the action 

1 = f dV [-<P t ^ + S(e)]+I q , (5) 
where e is an auxiliary, vector DoF. As before, I q couples to <j) (linearly), but not to e. We then get the field equations 

If the Hessian of S is regular (needed for an acceptable theory, and assumed all along) the second set of equations 
may be inverted to give e(D0) (D is the gradient). Furthermore, it can be shown that this inversion involves a single 
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function U(D(j)) such that 1 

dll 



(7) 



U is such that its Hessian and that of S are mutual inverses. Substituting in the first of equations (6), we see that <j> 
satisfies the field equation(2). 

If we substitute in the Lagrangian in eq.(5) Dip from the second of eq(6), and express the resulting Lagrangian in 
terms of D</>, we get, by definition, minus the Legendrc transform of <S(e M ). But this can be seen to equal — U(D(j)) up 
to a constant (because its derivative with respect to <fi^ is e M ). In other words, the theory (5) with e an independent 
DoF, is equivalent to the NL scalar theory (1) with U the Legendre transform of S. 

If, however, we do not permit e to be a general vector field, but constrain it, a priori, to be a gradient, e = D-0, 
with ip as independent DoF, we get a theory of the PL type. It is governed by the action 



dV[-T>4>-Thl> + S(£hl>)]+I q , (8) 
(dot product with the appropriate metric, which also raises and lowers indices) and it's field equations are 

n* = «, u*=(^= q *. (9) 

In the rotationally invariant case we write the action and the field equations as: 

I = J dV • + 5[(D^) 2 ]} + I q , (10) 



□V> = 9, n<t> = 2(Sty? ),„ = q,. (11) 

To solve these equations we need first to solve the linear equation for tp, with the source distribution q(x^), with 
the appropriate boundary conditions (BC). Then, substitute the solution in the expression for q cf ,(x fl ), which becomes 
a nonlinear, and nonlocal, functional of q{x 11 ). Then solve the linear equation for <f>, with q^ as source. This involves 
solving only the linear (Poisson) equation twice, with an algebraic step in between. So the practical advantages of 
such theories are obvious. 

The pair of equations (6) also look like two linear equations. But this is an illusion: one cannot, of course, solve 
the first for e^, and then substitute in the second and solve for <j> (for example, the first equation determines e M only 
up to a divergenceless field). It is only when we constrain e to be a gradient that the first equation does determine it 
(given the appropriate BC). 

If we want to approximate a given NL theory governed by some U(D<p), with a PL theory, then it would be a 
good choice to take S of the PL theory to be the Legendre transform of U. This choice automatically guarantees 
certain similarities between the two theories. In the first place it guarantees coincidence of the solutions in cases of 
1-D symmetry 2 , such as near point or line charges: in such cases all vector fields are gradients, and so the gradient 
constraint on e, which was imposed to get the PL theory, is automatically satisfied; so the solution of the PL theory 
is automatically the solution of the NL theory (for the same BC). 

Secondly, as I show below in section II D, such a choice of S guarantees that for weak fields, the solutions of the 
two theories coincide to the next order above the lowest, linear case. 

Defining the Lagrangian of the PL theory is not enough to fix the theory. There remains the issue of what BC we 
dictate for ip. While <f> is the "physical" field that is felt by charges directly, so we usually know what BC we want 
for it, i\) is auxiliary, and, in principle, we may have more freedom in choosing its BC. The procedure of picking BC 
for ip is part and parcel of the theory: Different choices of the BC of ip lead to different solutions for (f> even for the 



1 Define q a — dS/de a , then, taking the partial derivative of this relation with respect to qp gives — 

Multiplying by the inverse of the Hessian of <S, which is a symmetric metric, we see that de a /dqp is symmetric under the interchange 
of /? and a. This means that there exists a function U(q) such that e a = 9U/dq a . 

2 For example spherically- or cylindrically-symmctric configurations, if wc work in Euclidean space, or static, spherically-symmetric 
configurations in Minkowski space-time. 
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same BC. Sometimes, however, the BC for ip as well, are dictated by the problem. For example, in the problem of BI 
electrostatics, to be discussed in more detail below, or in the case of the NL, nonrelativistic MOND, Gauss's theorem, 
with the requirement that the solution becomes symmetric at infinity (spherically, cylindrically, etc.), applied to the 
first of eqs.(ll) implies that ip — > at spatial infinity (e.g., Vip oc r/r 3 in the 3-D spherical case). More generally this 
may be a thorny issue that may incapacitate the method in some cases. 

The fully NL theory has a linear, weak-field limit, if dU/d(j)^(0) — 0, and d 2 U / dcj)^d(j)^{Q) = S^" , and similarly 
for the PL theory (for \(j)^\, ^ !)• I* 1 * ne weak-field limit, the action of the PL theory becomes 

J« jdV[-^) 2 + ^(B x ) 2 } + I q , (12) 

where \ = 4> ~ V'- We see that if we work in Minkowski space-time, in which case our metric convention is rj^ = 
(—1,1,1,1), the kinetic term for (f> has the "correct" sign corresponding to positive kinetic energy, but \ nas the 
ghost-like sign of the free Lagrangian, but it decouples from all other DoF in this limit. So, the standard, linear 
theory is gotten. 

The ghost-like nature of \ might bode trouble for the full theory, when time dependent problems are considered. In 
stationary problems, such as all the examples above, this is not an issue. Even for a fully dynamical situations it is not 
clear that this aspect is deleterious, since x 1S n °t quite an independent degree of freedom that may be manipulated 
in itself. While \ waves may be carrying negative energy to infinity (where the linear approximation is good), it is 
not clear that there are charge configurations that emit net negative energies in toto, and thus become unacceptably 
unstable. This is, however, an important concern that remains to be addressed. 

As was discussed in [6], the relation demonstrated here between the pair of theories is analogous to that between 
the standard and the Palatini formulations of gravitational, metric theories. The Palatini-like approach, whereby e is 
independent (i.e., not assumed to be a gradient) gives the NL theory. The "standard" approach, with e a gradient, 
yields the PL theory. For the linear case, both routes of variation give the same field equation as the second of 
equations (6) becomes e = Dip. 



B. Similarity in more detail 

1. Equivalence for one dimensional configuration 

Consider in some more detail, the case of the rotationally-invariant theory. So S — S(e 2 ), and U = U(T)(f) 2 ). 
Consider a 1-D-symmetric configuration, such as a spherical symmetry in flat space. If the problem at hand is posed 
in a Minkowskian space, assume that the configuration is also static so the problem can be posed in the Euclidean 
space. Let the symmetry surfaces be designated by the coordinate r, so that q = q(r). By applying Gauss's theorem 
to both theories for a volume within a constant r, we see that the gradient of the potential 4> is a function of only 
Q(r), the charge enclosed within r. The form of this function depends on U and 5, respectively, for the two theories. 
So, given U we can choose S such that <fi will depend on Q(r) in the same way in both theories. It is easily seen that 
the condition for this is as follows: If we define the variable y such that 3 

y 1 ' 2 = 2U'(z)z 1 ' 2 , (13) 

then S has to satisfy 

z 1 ' 2 = 2S'(y)y 1 / 2 . (14) 

Either equation has to define y and z as monotonic functions of one another. 

It is easy to see that requirement (13) (14) is tantamount to S and U being mutual Legendre transforms (in the 
components of e and D<j> as variables, not in their squares), with the Hessians of U and S mutual inverses 4 . 

So in this case equivalence for 1-D configurations uniquely determines S to be the Legendre transform of U by the 
similarity requirements. This is not so for more general cases, such as multi-scalar theories, or the BI theory (see 
below) . 



3 z stands for |D0| 2 , and y stands for Di/>| 2 . 

4 The product of the Hessians is AS'U'S^ + e M e„(32<S' 3 W" + 8S"W + 6AS"U" S' 2 e 2 ), and is seen to give <5 M „, from cqs.(13)(14). 
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2. Equivalence to second order in weak fields 

Suppose the nonlinear theory has a linear weak-field limit (|D0| <C 1); so we have U'(0) = 1/2. The PL theory has 
this limit if <S'(0) = 1/2. In this limit D<f> = Dtp = —q, = (where a bar designates the solution of the linear 
theory for the same charge distribution and BC). Writing <f> = <f> + r], we have to lowest order in rj 

□r^-2W"(O)[(D0) 2 ^U (15) 

in the NL theory, and in the PL theory we have (V> = <f>) 

□tj = 2S"(O)[(D0) 2 0JV (16) 

The two are the same if S"(0) = —U"(0). If U"(0) = 0, there is a similar condition on the first derivative that does 
not vanish at zero (see below in II D). 

Condition (14) guarantees this: it implies that AW {z)S' (y) = 1 for y and z related by eq.(13). Taking the y 
derivative of this at y = z = we get <S"(0) = —U"(0). So the equivalence of the theories for 1-D configurations 
implies equivalence to second order for all configurations. 



C. Weak perturbations in the two theories 

Suppose the solutions of the field equations of the two theories <j>, xp are known for a charge distribution q. Expanding 
about this solution to first order in a small perturbation rj = <j> — (j>, A = ip — ip, caused by a small change in the 
density e = q — q, we have for the NL theory 

[«r(D0)v],. - C (17) 

and for the kindred PL theory 

□A = e, □» 7 =[?C(rty)A,„],„ (18) 

where T-L u and W s are the Hessians of U and S respectively. These two sets of field equations are, generally, not 
strongly related. However, as we saw, if the unperturbed problem is of 1-D symmetry, % u and T-L s are mutual inverses 
everywhere; so the two perturbation problems become related (but not the same): The NL theory gives an analog of 
scalar, linear electrodynamics in a position-dependent, anisotropic dielectric, which we can write as 

[^(z)^ = DA, (19) 

which is still difficult to solve, generally. The PL theory gives the Poisson equation 

Dr, = [{A~ 1 ) ltu (x)X, „],„, (20) 

where DX = e. 

In some instances we deal with a charge system e, of small extent, embedded in a meta-system, whose effect on the 
subsystem may be approximated by a constant external field. This external field is then not part of the dynamics, but 
is dictated as BC: We seek to solve the NL or PL problem for e, where in the former we dictate the BC of constant 
T)<f> = go at infinity, while in the latter we have to dictate both D0 = go and Dtp = fo at infinity. In this latter case 
go and fo are not a priori related without specifying what the meta-system is like, and where in it the subsystem 
is. For example, if the meta-system has 1-D symmetry, go and fo are parallel and their magnitudes are related. If, 
in addition, the subsystem can be treated as a small perturbation, we have eqs.(19)(20), with A^ v now a constant 
matrix. For example, when U{D4>) = U[(D(f>) 2 ] (and similarly for S), we have 

A = U u = 2U' {I + 2U' e ®e ), (21) 

where I is the unit matrix, Uq and U' are the values of W and its logarithmic derivative calculated at go, and e is a 
unit vector in the direction of go- Taking, say, the 1 axis in the direction of eo, and defining A = X/2Uq we can write 
eq.(19) as 

Or] + 2^77,1,1 = DX, (22) 

and eq.(20) as 

□77= [(l + 2^)- 1 -l]A M + DA, (23) 

with DX — e/2Wg. In the coordinates x 1 = (1 + 2Wo)~ 1//2 x 1 , x % = x % for i > 1, eq(22) takes the same form as eq.(23), 
and both theories then involve solving a linear Poisson equation. 
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D. Multi-scalar theories 



Consider a NL theory of many scalar fields, with the Lagrangian: 

N 

£=-W(D^\..,D0 w )-£y g °, (24) 



0=1 

leading to the field equations 

( dU \ 



q a . (25) 
As in the single-scalar case, the Lagrangian 

JV 

C = J2 - D <T • e ° + ^(e 1 , e N ) - Yl <^ a , ( 26 ) 

a a—1 

with S the Lcgcndrc transform of U (with respect to all variables), gives the field equations (25) for <f> a , and is an 
equivalent theory. If, however, we constrain e a to be gradient fields, e a = T)ip a , with ip a the fundamental DoF, we 
get a different theory: the PL theory, whose field equations are: 

Ur = q a , □r=(^r) (27) 

For 1-D configurations, all the e a are automatically (parallel) gradients; so the constraint leading to eqs.(27) is anyhow 
satisfied even in the NL theory; so the solutions of the two theories coincide. In this context, there is a difference 
between the single- and multi-scalar theories: In the single-scalar, rotationally- invariant theory (3), U and S are 
functions of only one variable, as they depend on T)<j>, or T)ip, through (D(/5) 2 , or (T)xj)) 2 , respectively. Then, the 
requirement of coincidence for 1-D configurations is enough to pinpoint S, uniquely, as the Legendre transform of U. 
For a multi-scalar theory, this is not the case, even for rotationally-invariant theories: Now, U and S depend on their 
vector variables through the invariants, D(f> a ■ D<fi b , or Dtp a ■ D?/> b . But for 1-D configurations, only a subset of the 
variable values is probed, since each of the T)<j) and T)ip is determined by only one component, and S and hi become 
functions of only these N single components. So, clearly, only the dependence of S on a subset of its variables enters, 
and is constrained, by the requirement of 1-D equivalence. This, as we shall see, is the case for NL electromagnetism as 
well. The Legendre-transform choice is thus not unique. But it might have additional, yet unappreciated advantages. 

Another attraction of taking S to be the Legendre transform of U, is that it gives a PL theory that coincides to 
next to leading order in weak-fields, with the NL U theory: Suppose the NL theory has the standard linear theory as 
its weak-field limit. Expand in the field equation (25) around zero. Let n > 2 be the lowest order, beyond the 

Hessian, for which not all the derivatives of U vanish at zero 5 , Write <fi a — <f> a + ry a , where (j> a is the solution of the 
linear problem (with the same BC) and expand up to first nonvanishing order n: 

' , * ? -(0)+o^ 3 r(0)(e + ^)} + 



{1 f) n ?J I 

( n -i) !a ^tf,..-^. (0)[( '"- + '' s) --- (fc - + "--' )] }; (28) 

with repeated a indices summed over. For the theory to have the linear limit, D(f> a = q a , for weak fields, as is assumed, 
it follows that ^p-(O) = 0, and that Q ® a ^ t (0) = b ah ^ h ' . Thus, the next order correction 77° is gotten as the solution 
of 

a >" ^ - (.^iiafea^^. ' ''* 1 -'-"^-'- '- (29) 



For example, if we require space-time reflection invariance, U is even in D<i!> a ; so n > 4; n = 4 would be generic. 
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In the corresponding PL theory, the solutions of the linear theory are the same. The next order correction is gotten 
in a similar way: 

1 d n S 

= (n-l)!^^...^/ ^^---^^- (30) 

So, equivalence of the theories to this order follows if all the nth derivatives of S and U at zero argument are equal 
in magnitude and opposite in sign. 

Since U(x a ) and S{y a ) are mutual Legendre transforms (I write, for brevity, x a for the variables 4> a jl of U, and y a 
for the variables ip^ of S) their Hessians are the inverses of each other (at all values of the argument): 

^ <« S t . ( 31 , 



dx a dxb dy b dy c 

So also d 2 S/dy a dy b (0) = 5 a b [and dS/dy a (0) — 0]. Taking successive derivatives of eq.(31) with respect to the x a s 
at zero argument (and noting that dy a /dxi,(0) — S ba ), we find, first, that all the derivatives of S of order between 3 
and n — 1 also vanish at zero arguments, and that 

. ( „, = -^_ ( 0,, (32) 



dx a i...dx a n dx a i...dx a n 

thus confirming that the two theories give the same i] a . {Compare with the condition below eq.(16), where is is 
assumed tacitly that n — 4: when U = U[(D<p) 2 ], the fourth derivative of U with respect to component of D0, at 
zero, is proportional to U"(0).} 



E. Phantom charges 

For theories that have a linear limit-either for weak fields, as in the systems (i-iii) mentioned in the Introduction, 
or for strong fields, as in MOND-it is useful to introduce the notion of the "phantom" charge density, q P {x^) (or 
"phantom" current density in the electromagnetic case) (PC). The PC is the charge distribution we have to add to q 
to make <j), a solution of the linear equation with the same BC. In other words, 

q p = n<t>- q. (33) 

If the solution of our theory is unique for given q{x^) and BCs, knowledge of q p {x^) is equivalent to knowledge of <p. 

This concept is useful because it may help us bring our experience with the linear problem to bear on the nonlinear 
problem, if we have some knowledge of properties of the PC. 

For a genuinely NL theory, we cannot know q p before we solve the full problem. But in PL theories of type (10), 
q p is known once we have the solution of the linear theory: 

q p = 2(<Sy"W)^ -Q = q4>-Q- (34) 

In modified gravity theories such as nonrelativistic MOND, the phantom charge (mass, in this case) represents 
what we would interpret as "dark matter" if we insist that the Poisson equation governs the gravitational potential 
0, when, in fact, it is the NL theory that does. Much use of the PC has been made in this context (see, e.g., [12] for 
a review) . 

Note, importantly, that, unlike q, the PC is not an independent quantity that can be dictated at will: given the 
BCs it is fully determined by q(x^). 

As an example of some preknowledge of properties of the PC, consider a static problem in Euclidean space, with 
a linear, weak-field limit [e.g., of the tree types (i-iii) mentioned above] and assume that g(x) is bounded and has a 
finite total charge. Applying Gauss's theorem to eq.(3) we see that the weak limit is approached at spacial infinity. 
Thus, W — > 1/2 in this limit and so it is seen that the total phantom charge vanishes: J q p (x)d 3 x = 0. This is 
clearly true also for the PL theory (10), where, again, applying a Gauss integration over the whole volume gives 
J q^,('x.)d 3 x = J q(x)d 3 x; so that J q p (x)d 3 x = 0. This is not the case in a theory like MOND where the linear theory 
is approached in the strong-field limit, not in the weak-field one. Here, for an isolated mass, the total phantom mass 
diverges, since the phantom density decreases as 1/r 2 at infinity. 
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III. ELECTROMAGNETIC VECTOR THEORIES 

The standard Maxwell Lagrangian is: 

C=-- A F^F^ + C M {A^...), (35) 

where the basic DoF are the components of the vector potential A^, such that F^ v = A v ^ —A^^, and the dependence 
of Cm on the other DoF is suppressed. The resulting field equations are 

= J", (36) 

(plus the identities-the homogeneous Maxwell equation: F^ ^ = 0.) where the current J v is, as usual, such that 
5I M = JJ V SA„. 

In more general, nonlinear electrodynamics, such as Born-Infcld (BI), we have a Lagrangian of the form 

C = -U{F^)+C M {A^...). (37) 

The resulting field equations for A^ are 

_ ( 9U \ 



dF. v 



J». (38) 



Lorentz invariance dictates that the Lagrangian depends on F^ v through its invariants. In four dimensions (which 
I assume all along for concreteness) these are the two invariants 

p F = J V" = \{B 2 - E 2 ), q F = l -e a ^F aP F^ = -±E • B, (39) 

where E and B are the electric and magnetic fields 6 . So write 

£=-U(p F ,q F )+£ M (A ft ,...). (40) 

For example, in the standard BI theory 

U = El{-\\^ + F^/E \\fl\ (41) 
up to a constant, which is immaterial here, as I assume flat space-time. This can be written in four dimensions as 



1 E 2 -B 2 (E-B) 2 i 1/2 



(42) 



In terms of U{p F , q F ) we can write the field equations as 

(U P F^ +U g F^),„ = J», (43) 

where U p and U q are the partial derivatives of U, and F^ v = (l/2)e fl ' /a ° F a p. (The homogeneous identities can be 
written as F% v = 0.) 

For the theory to tend to Maxwell's in the limit of weak fields we need U p = U p (0, 0) = 1, and U q = U q (0, 0) = 0. 
We see that for the BI theory, we have, in addition, U q (p, 0) = 0, for all p. This is more generally the case in theories 
with time-reversal invariance, since q changes sign under time reversal. I will assume this in what follows. 



I work with a (—1, 1, 1, 1) signature, and c = 1, e Q) 9 M „ is the totally antisymmetric tensor; eoi23 = 1; so, e° 
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A. Practically linear kindred 



Now, consider a class of PL electrodynamic theories that involve, in addition to A^, an auxiliary vector field B^, 
with the corresponding Q^ v = B v fl — B^ v , and having the Lagrangian 

C = -\f^Q^ + S(Q^) + C M (A^ ...), (44) 

where Cm is the standard matter Lagrangian; it depends in the standard way on (but not B^), and on the other 
matter DoF. Variation over A^ now gives 

= J**. (45) 
Namely, B^ is a Maxwellian EM vector field for the given current distribution. Variation over B M gives 

pp = J » = 2 ( 

So the EM field A M is also a Maxwell field but for the current J^. This current is identically conserved, because 
p\iv = 2dS/dQ^ v is antisymmetric in the indices. is an algebraic expression of Q a p, the Maxwellian field of the 
problem. The theory has the double gauge invariance 7 . 

In four dimensions it is convenient to write the S as a function of the invariants 

C = -\f^Q» v + S(p Q ,q Q ) + C M (A^ ...). (49) 
The field equations (46) are then written as 

Fr = (S P Q^ + S q Q^)„- (50) 

Here, the right-hand side is given once the solution of the Maxwell equations of the problem is known. 
It is sometimes useful to write the theory in terms of = F^ v — Q^ v instead of F^: 

#T = i( S P - 1)3"" + S q Q^], v . (51) 

To insure the Maxwellian weak-field limit we have 5° = <S P (0,0) = 1, and S q = S q (0,0) = 0. We can then write to 
lowest order in weak fields 

C = -\{F„ V F» V - H^H^) + C M (A^ V), (52) 

We see that, again, A^ has the standard Maxwell action and couples to currents in the standard way, while = 
A-n — B^ decouples altogether. Note, however, that, as in the scalar case, this latter, auxiliary field, has the "wrong" 
(ghost-like) sign of its kinetic action. 




B. Similarity conditions 

What choices of S give a PL theory that is equivalent to the NL theory for 1-D configurations, and equivalence to 
second order in the field gradients, for any configuration. 



7 We can write the result above in terms of the Hodge decomposition of the 2-form P = P^ v dx^ t\dx" : If wc decompose (the decomposition 
is unique when appropriate BC are imposed, effectively compactifying the underlying space) 

P = dA + SB { V + h, (47) 

where, A = A^dx^ is a vector potential, is some 3-form, and h is harmonic (a vacuum solution). Then 

F = F^dx" A dx" =dA + h. (48) 

It satisfies dF = 0, the homogeneous equation (the Bianchi identities), since harmonic forms are closed (and co-closed). It, clearly, also 
satisfies eq.(46), which can be written as d<(F — P) = 0; the homogeneous identities dF = are also satisfied 
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1. Equivalence for static 1-D problems 

For a static, 1-D configuration, currents depend only on one space coordinate, x m , in an orthogonal coordinate 
system (e.g., a time- independent, spherically symmetric charge distribution, or a current density distribution in an 
infinite cylindrical wire). 

Staticity and current conservation imply V • J — 0. Also, V • B = 0. Applying Gauss theorem to surfaces of constant 
x m shows that J m = 0, B m — 0. These apply in both theories, to both J M and J^, and to all the magnetic fields 
(i.e., those in both Q^ v and F^„). Also, the electric fields, being gradients, can have only an m component; thus 
lF = q Q = 0. 

If the Lagrangian is stationary in q at q = 0-as is the case in the BI theory, and more generally in theories in 
which q appears quadratically in the Lagrangian (e.g., forced by time- reversal invariance)-which I assume, we have 
U q (p F , 0) — 0. So, the field equations (43) can now be written (writing the current in terms of the Maxwellian solution: 

[W> F ,0)^ fc -Q^ fc ], fc = 0. (53) 

Similarly, for the PL theory the field equations (50) read 

[F" k - S p (p Q ,0)Q" k l k =0. (54) 

Applying a Gauss theorem for the time component, and Ampere theorem for the space components show that in 
both theories, the expression in parentheses vanish for our static 1-D configurations: 

U p (p F ,0)F» k = Q» k , (55) 

for the nonlinear theory, and 

F» k =S p (p Q ,0)Q" k , (56) 

for the PL theory. [The fields Q^ k are the same in both theories.] So, clearly, 1-D equivalence of the theories is 
tantamount to 

U'(p F )S'(p Q ) = 1, (57) 

for the values of p F and p Q that correspond to each other (in either theory), and where I defined U{p F ) = 
W(p F ,0), S(p Q ) = S(p Q ,0). 

To find the relation between p F and p Q , contract each side of equations (55) (56) with itself, to get 

[U'{p F )] 2 p F =p Q , [S'{p Q )] 2 p Q =p Fl (58) 

respectively. 

To recap, given U, if we choose S so that eq.(57) is satisfied, for p F and p Q related by either of equations (58), we 
get a PL theory with 1-D equivalence to the NL theory governed by U. Thus, iS(p Q ,0) is determined uniquely (up to 
an additive constant), but not, of course, S{p Q , q Q ) 

For BI, where U{x) = Eq(1 + 2x/E 2 ) x / 2 , one gets from the above requirement of similarity, S{y) = —Eq(1 — 
2y/E 2 ) 1 ^ 2 (up to a constant). So we get for the BI case S(y) = —U(—y). 



2. Second- order equivalence 

Another way to constrain the dependence of 5onp, q is to require that the two theories coincide for a general 
problem up to next to lowest order in the fields (if we require in the first place that both coincide with Maxwell's 
linear theory to lowest order). 

Start with eq.(43), where we write F^ u = Q^ u + H^, with the Maxwellian solution. Subtracting the zeroth, 
Maxwellian order, we are left with 

-Hp = l4 P (p Q Qn,»+^(QQQ^+PQQn,*+1&(QQQ ltu U (59) 

where the lowest order, Maxwellian solution is substituted everywhere in the right-hand side. Similarly, in the PL 
theory we get to this order 

= S% p (p Q Qn,» + S° pq {q Q Q^ +p Q Qn,» + S° qq {q Q Qn, v - (60) 

The two theories are then equivalent to this order in the fields if, in addition to U p = S p and = S® = 0, we have 
IA^a = ~<Sij, similar to the conditions (32) in the multi-scalar case. 
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3. The Legendre connection 

Unlike the single-scalar case, where the requirement of equivalence for static, 1-D configurations, determines S{y) 
given U{z), here S(p F ,0) is determined, given U(j> Q ,0), but not its depcndc nee on q F . Also, unlike the single-scalar 
case, this condition, alone, does not insure equivalence of the theories to next order in the fields; for this we further 
require conditions on the second derivatives of S at zero arguments. 

Taking a cue from the scalar case, we choose S to be the Legendre transform of U in the six field variables 8 F avi or 
in E and B. It is then seen that if we opt for a Palatini variation, namely, we consider the antisymmetric Q alJ a basic 
DoF, without forcing it to be a curl, we get the genuinely NL theory (37-38). If, however, we do constrain to be 
the curl of a vector B^, we get the PL kindred of this theory, as discussed above. This PL theory automatically satisfies 
the above two similarity requirements: One expresses the Hessians of U and S in terms of the partial derivatives of U 
and S in p and q. Then one uses the fact that the Legendre connection implies that Hessians are mutual inverses, for 
corresponding values of their variables, to show that: a. U 1 (p F )S' {p Q ) = 1 (as well as various other relations), and b. 



that at zero-fields U% 



For the BI case, start with U(E, B) given in eq.(42); define as usual 



n = — - - — 

= 9E ~ ~5E' 



dB dB' 



(61) 



Then, the Legendre transform of W(E,B) is 



5(D.H) = -D-E + H-B-W = -Ei;(l-|£-|| 



1/2 



-El 



1 + 



D 2 -H 2 



(D H) 



El 



E 4 



1/2 



(62) 



It is easily checked directly that with this choice of S, both the above conditions for equivalence in 1-D configurations, 
and equivalence to second lowest order in the fields for any configuration, are satisfied. As said above, these conditions 
do not determine S uniquely, as the first concerns only its dependence on p Q for q Q = 0, and the second constrains 
only some derivatives at zero fields. But this choice of S might have additional advantages, which I have not yet 
pinpointed. 

The BI Lagrangian is special in that it has the same form as its Legendre transform, only with the opposite sign of 
Eq [compare with eq.(42)]. In other words S(Q^,El) = U(Q^, —El). Or, in determinant form, 



S = -El(-\\ v ^ + i Q^/E4)^, 



(63) 



IV. A TOY EXAMPLE: SPECIAL-RELATIVISTIC KINEMATICS 



Consider now the construction of an analog PL theory for the even simpler problem of point particles in Minkowski 
space-time, with world lines x%(t) (k is the particle index). The action is 



J2 mk / 

7,, J 



drt + lint , 



(64) 



lint being the interaction action, which depends on the x£(t). Pick some Lorentz frame in which the particle 
trajectories in space are Xfc(t), for which the free particle action is —J2k mk I dt(l — vjt) 1 ^ 2 ( w hh = Xfe). The 
equations of motion are nikd[(l — v^) _1 / 2 v fe ]/dt = Ffc(xi, X2, ...) = Slmt/Sx-k- In the PL analog we add auxiliary 
degree of freedoms (t) and the action is 



I = ^TO fc J dt[x k -y k - ^X(yl)] + hnt, 



(65) 



where the interaction depends only on Xfc. Varying over x^ and y respectively gives 

m k y k = Ffe, x fe = d[\'(yl)y k ]/dt. 



(66) 



Not in p and q, and not, e.g., with respect to E alone, which would give the Hamiltonian of the theory. 
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In other words yfc (t) is the solution of Newton's equations for the same run of the forces on the particles as in the 
relativistic problem 9 , and once it is solved for and substituted in the right-hand side of the second equation, we have 
another Newtonian equation to solve. 

To lowest order in the velocities, the free Lagrangian (for a unit mass, and dropping the particle index from now 
on) is Lk ~ x 2 /2 — (x — y) 2 /2. We see, again, that the difference degree of freedom has the "ghost-like" sign of the 
kinetic term, but this decouples altogether from other DoFs. Is this a bad sign for the theory? 

The second eq.(66) integrates to x = A'y (with the appropriate initial conditions). We see that the two velocities 
are always parallel, and squaring this relation x 2 — [A' (y 2 )] 2 y 2 , gives an algebraic relation between their magnitudes; 
so y can be algebraically expressed in terms of x. It is seen that if we take X(z) = 2(1 + z) 1 ! 2 [— A/2 is the Legendre 
transform of (1 — v 2 ) 1 / 2 ], and plug the expression of y 2 in terms of x 2 back in the action, eliminating the dependence 
on y fc , we get the standard Lorentz Lagrangian (64). So, in the above chosen frame, the action 

I = J2 m k J dt[k k ■ j k - (1 + y 2 k ) 1/2 ] + I int , (67) 
fe 

gives an equivalent theory. This action is not Lorentz- invariant, but gives invariant equations of motion for the x£(t). 

For the kinetic Hamiltonian, H K = Pi — L K , where Pi = dL/d^i [£ = (x, y)], we find H K = x-y — A'y 2 + A/2. 
For solutions of the equations of motion the first two terms cancel and we are left with Hx = A/2 = (1 + y 2 ) 1 / 2 , which 
is always positive. We can also express it in terms of x 2 , through the algebraic relation, which gives the standard 
special-relativistic energy Hk = (1 — ± 2 )~ 1//2 . So despite the alarming appearance of ghost-like terms in the linear 
limit, the theory is stable and otherwise healthy. The two theories have in fact the same solutions under the same 
initial conditions. 

The identity of the two theories follows immediately from the fact that the t is the only independent variable of 
the degrees of freedom; so the constraint that would differentiate between the theories is not a real constraint. The 
PL construction is not of practical use in this easily integrable case, but it is a useful heuristic example. 



V. EXAMPLE: FIELDS AND FORCES IN NONLINEAR ELECTROSTATICS 

I now look more closely at the specific example of BI electrostatics, to highlight some of the similarities and 
dissimilarities between this theory and its kindred PL theory. Some related issues in the context of the BI theory 
were discussed in [13]. 

Consider an electrostatic configuration made of a charge distribution p(r) in 3-D Euclidean space. The action for 
a general, genuinely NL theory of electrostatics, with one potential, to which the general Lagrangian (37) leads, can 
be written as 



J {-«[(W) 2 ] - P<t>} d 3 r, (68) 



where <j> is the electrostatic potential (E = — V^>). [I have changed the notation a little: here it is convenient to use 
x = (V0) 2 and y = (Vip) 2 as variables, and the Lagrangian functions for the two types of theory will be denoted u(x) 
and s(y). I also choose the arbitrary additive constant in u and s so that u(0) = s(0) = 0.] The field equation is 

2V • {u'V<j>) - 2u' (Sij + 2«'^j^ <t>,i ti = = P, (69) 

where, is the Hessian of u with respect to the variables <f> i, and u' — xu" (x) ju' (x) is the logarithmic derivative 
of u'. 

If the theory has the standard, linear, weak-field limit, as I assume in what follows 10 , u{x) w — x/2 for x — > 0, 
and thus u'(0) = 0. Ellipticity of the field equation, which I require, is tantamount to Wfj being regular. Since 



9 This means that to get y^{t) we first have to know the x fe (t), calculate for these the time runs of the forces F fe , and calculate the 
Vfc(t) as the Newtonian trajectories for these forces. Alternatively, wc could have considered a problem in which the forces Ffc(t) are 
dictated in some Lorentz frame, instead of the interactions. Then yk(t) are the Newtonian trajectories for these forces, and xj.(t) are 
the special-relativistic ones. 

Since we are dealing with electrostatics, u is negative, so as to give repulsion for like charges. In NL theories of gravity, such as MOND, 
u is positive. 
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one of its eigenvalues is oc 1 + 2u' , which must not vanish, we have that u'(x) > —1/2 for all x. Another useful 
inequality follows from this, with our normalization u(0) = 0: u = xu'/u > 1/2. To see this consider the function 
w(x) — xu'(x) — u(x)/2 = u(u — 1/2). We have w(0) = 0, and w'(x) < 0, by virtue of the above inequality for v! , and 
the fact that u' < 0. So w{x) < 0, and vanishes only for x = 0. Thus w/u > for x > (since u < 0) leading to the 
required inequality. 

The corresponding PL action, gotten from the Lagrangian (44), is 



J {V0 • VV> + s[(W>) 2 ] - pcj)}d 3 r (70) 



Variation on <f) and ip, respectively, gives 



Aip = -p, Acj) + 2V-(s'Vij) = A(j) + 2s'l8 ij + 2s'^^)^i j = A(j) + H & i ^ tj =Q, (71) 

V |Vi/>| 2 / 

where, "Hf - is the Hessian of s, and s' = ys"(y)/s'(y). 

We saw that in this, single-potential case, the requirement of equivalence for 1-D configurations fixes s uniquely as 
the Legendre transform of u (in the variables V^>, V^>). Thus, Tifj and "H^- are mutual inverses at the corresponding 
values of the variables, and the inequalities s' > —1/2, s = ys'/s > 1/2 apply for s, as for u. 

Unlike the field equations (69), which generally require numerical solution, the solution of the PL theory can be 
written in closed form as space integrals: For example, for a system of point charges qi at 



i 

and 

where, A = 2s'Vip. Integrating by parts gives other useful expressions (for theories with standard weak-field limit, 
as I assume here, the surface integral at infinity vanishes): 



V0(r) = ^/v.A(r')^-^dV, 



and after integration by parts 



d 3 r' 



1-3 



(r' - r) (g> (r' - r) 



r — r • 



'A(r'), 



where / is the unit matrix. Changing variables: 



1 [ d 3 r 



I-'i 



r ® r 



A(r + f). 



(75) 



(76) 



(77) 



This converges at infinity because A vanishes there, and at f = 0, because the angular integrals for constant A are of 
spherical harmonics, and vanish. 
For BI electrostatics, 



i{x) = El\{\ xlElfl* 1], s(y) = El\l - (1 + y/Eft) 1 ' 2 ]; 



(78) 



so A = —Vip/(1 + IV^I 2 ) 1 / 2 . Interestingly, s is then the Lagrangian for the volume extremization problem. Taking 
from now on E a = 1, we have here s'(y) = — y/[2(l + y)]. 
The phantom densities in the two theories are 



E 



p p = p[(l-£ 2 ) 1 /2-i]-__ E .V|E 



(79) 
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in BI, while in the PL theory it is 

*> = d ( i + is)i/ a - V - ( i + tg)3/ 2 E - • ^d, ( 8 °) 

where E c is the Coulomb field. The total phantom charge vanishes in both theories. 

The two theories are equivalent for 1-D configurations, in which case expressions (79) and (80) are seen to be the 
same. 

A. Integral relations for the PL theory 

There are several useful integral relations. First note that integrating the second of eq.(71) for <f> over any volume 
bounded by a surface S, we have 




on the surface. Now, multiply that equation by ip, and integrate over a volume V^, bounded by an equipotential 
surface, S, of ip. Integrating by parts, the surface integrals are equal, by relation (81), and cancel, and we get 

/ V</> • W> = -2 / s'lWfdV (82) 
A useful corollary is that for any such volume, the integral of the free-field Lagrangian density in expression (70) is 

I f (V)= [ (V0 • W> + s)d 3 r = f (s-2s'|VV>| 2 )d 3 r = f s(l - 2s)d 3 r. (83) 
JV4, Jv^ Jv^ 

This integral is nonncgative, and vanishes only if Vip = everywhere in the volume, since s(z) > 1/2, as shown above, 
and s < with equality only at z = 0. This applies, in particular when integrating over the whole space. 

Now multiply the field equation by <p, and integrating in a volume, , bounded by an equipotential of (p. We then 
get in the same way 

/ (V0) 2 = - / 2s'V<P ■ Vip. (84) 
JV4, Jv^ 

For Vfj, the whole space, both equalities apply. 

B. Properties of the field 

There is much known about the solutions of elliptic equation of the type (69) (see, e.g., [14]). Paradoxically, even 
though the PL kindred is simpler to solve, I am not aware of discussions of the analog properties for it. I now discuss 
briefly several of these properties, where I mainly pose questions and suggest some insights pertaining to possible 
answers. 

1 . Extrema of the potential 

It is well known that solutions of eq.(69) cannot attain extrema in vacuum, except on boundaries (this is known as 
a maximum principle [14]). This means, for example, that we cannot suspend a test charge in an electrostatic field, 
outside source charges 11 ; is this also true for the 'physical' potential, <p, in the PL analog? (It is true for ip, of course.) 

I was not able to find an answer for this in the mathematical literature. The potential <j> satisfies the Poisson 
equation with p + p p as source, but while p may be localized on a finite support, p p is not, in general. We see from 

cq.(80) that this density does vanish where p — E c • V|E C | = 0. For example, this holds at all the critical points of 
the Coulomb potential of a system of point charges (where E c = 0). So, for example, <f) does not attain an extremum 
at symmetry points (such as the midpoint between equal charges). It is also true in 1-D configurations. 



It was shown in [15] that it is possible to suspend a non-test charge, or a rigid body of test charges, in vacuum in certain configurations. 
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2. Boundedness of the electric field 

In BI, the electric field strength E = \V<j)\ is bounded from above by 1 (E ); is this also the case in the PL theory? 
I have not been able to answer this question in general. 

In the 1-D configurations this is clearly the case, since then E = A = E c (l + E 2 ,) -1 / 2 . So, E is indeed bound, 
and E — > 1 when approaching source singularities, such as point charges. But is this true always? In general we only 
know that V • E = V • A, from which we can derive average upper limits for E. For example, eq.(81) tells us that for 
any closed surface 

Jv<f>-do = J A • do < J \A\do < J da. (85) 

If the surface is an equipotential of <f) (on which V</> • da = Eda) we have 

Eda < / da. (86) 



In other words, the area- weighted average of E on any closed, <f> equipotential surface does not exceed 1. 
Also, from eq.(84), we have for a volume within such a surface 



f E 2 = [ E A< / E\A\< [ E. (87) 



This means J v E(E — |A|) < 0; so E cannot exceed |A| everywhere in the volume. 



Forces on bodies 



Consider the force, F v , acting on a subsystem of p(r) made of all the charge within some sub- volume V. Some 
relevant results were derived in [16] for theories governed by NL actions of type (68), and in [6] for theories of type 
(70). F v is writable as an integral over any closed surface, S, that surrounds all the charges in V, and excludes all 
others: 



? v = J P-da, (88) 



where P is the stress tensor defined as the functional derivative of the free-field action with respect to the background 
metric, and do points outward of V. More specifically, 

6I f ieids = \J g 1/2 PijSg ij (89) 

(To identify P lJ we write the action on a curved background, after which we can specialize back to a flat background). 
For the genuinely NL theory (68) this gives: 



J uda -2u'V<j){W<t>- do). (90) 



[This is correct provided u(0) = 0, otherwise u — u(0) appears instead of u] For the PL theory (70) we get [again, 
provided s(0) = 0]: 



J -(s + V0 • Vip)do + 2s'VV'(VV> • do) + Vip(V(j) ■ do) + V(f>{Vip ■ do). 



(91) 



For example, to calculate the force between two equal charges we can choose S as the symmetry plane completed by 
an hemisphere at infinity, on which the integral vanishes. From symmetry, ■ do = V<^> • do = 0, and V</> || Vip on 
the midplane, and we have for the two theories respectively 



= f uda, F v = - [ (s+\V(j>\\Vi>\)do. (92) 



To calculate the force in the nonlinear theory we first need to solve the field equation (69) given p and boundary 
conditions (V<j> — 0) at infinity. We then use the result in the integral (90). The calculation in the PL theory is more 
straightforward, as we have a closed form for the integrand in expression (91). 
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1. Force on a spherical charge by a weak charge distribution 

Consider a system made of an arbitrary 1-D charge distribution p s (e.g., spherical; e.g., a point charge), and an 
arbitrary charge distribution p that is so weak that we can treat it as a test-charge distribution. Then, the force on 
p (by momentum conservation also minus the force on p s ) is easily calculated in both theories (e.g., [15]). This force 
is simply J pE s d 3 r, where E s is the electric field produced by p s alone. Since p s is 1-D, E s is easily calculated, and 
furthermore, it is the same in the two theories. For example, it can be shown, based on this, that a point charge q s 
can be suspended at the center of a cube, at the corner of which we have charges q of opposite sign, and \q\ -c \qs\- 



2. Attraction and repulsion between bodies of uniform charge sign 

In [16] I formulated a push-pull conjecture pertaining to the question of attraction and/or repulsion between bodies 
each made of a charge distribution of a uniform sign. In the present context: Suppose we have two parallel planes, 
say parallel to the x — y plane, with a charge distribution pi > between the planes, p 2 > to the right of the two 
planes, and p$ < to their left. Then, in a theory like BI, it is was conjectured that the force on p\ is always to the 
left. This conjecture implies, e.g., that bodies of uniform-sign charge separated by a plane always repel each other if 
they have the same sign (namely the force on each of the bodies points to the other side of any separating plane), 
and attract for opposite signs. The general conjecture is trivial to prove for the linear, Coulomb electrostatics. But I 
was able to prove only special cases of this conjecture for BI electrostatics (for example the case where p\ is spherical 
and monotonically decreasing from its center out). There is an even more elementary result that holds for BI: If 
p > (< 0) are all the charges in space, and C is the convex closure of the support of p (C is the smallest convex 
volume containing all points where p ^ 0), then, at any point outside C the field E points away from (into) C. This 
I proved in [16] using a comparison principle for an equation like eq.(68). 

For the PL kindred I was not able to prove even these special cases. Of course, if one of the three bodies is 1-D, 
and the other two are test bodies, the conjecture is easily seen to be correct in the PL theory. 



3. The two-body force 

One of the interesting problems in NL electrostatics is the calculation of the force between two point charges. On 
dimensional grounds, the force between charges q\ and q 2 , a distance I apart, can be written as (reinstating Eq) 

F( qi , q2 J) = ^f(X,0, (93) 

(positive for repulsion) where the dimensionless variables are A = (\qi\ + \ q 2 1) /4ttE £ 2 , and the charge ratio £ = qi/q 2 
(ICI < !)• It was proven in [16] that, with the choice of the sign of the free-field action in expression (68), point charges 
of the same (opposite) sign repel (attract) each other so / > (the opposite is true when the sign of the action is 
inverted, as in MOND gravity). The weak-field limit applies for A — > 0, where we have for both theories 12 /(0, £) = 1. 
The fact that the two theories are the same to second order tells us that / for the two theories are the same also to 
first order in A. The limit ( — > corresponds to qi being a test charge in the spherical field of q 2 , which is known 
analytically. Thus, the two theories coincide and give: /(A,0) = (1 + A 2 )" 1 / 2 . For the limit A oo both theories 
give a constant force, and we can write then /(A — > oo, () — > A _1 /(C). 

In VII , I calculate f(() analytically, in BI electrostatics, for charges of the same sign, for any dimension. For 3-D, 
I find a repulsive force 

F^^, (94) 

that is, /(C) = 1. 

In two dimensions, or for parallel lines of uniform charge (where the charges and the force are per unit length), I 
get in this limit 

F( qi ,q 2 ,£ -¥ 0) -> sin tt . (95) 

7T V 91 + 92 / 



The fact that the field is strong near the point charges is immaterial. When A<1 we can choose the integration surface in eqs.(90)(91) 
where the field is weak, and the force attains its weak- field limit (this needs more careful showing, but is correct). 
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This is the same as the result in [17], obtained using a method that applies only in 2-D. 

For the PL theory I have not yet been able to derive similar results, as the specific method used in the BI case does 
not directly apply to it. 

VI. DISCUSSION 

I have presented a class of theories that on one hand describe nonlinear physics, but which require solving only 
linear differential equations. There are examples where such theories, when properly constructed, serve as very useful 
approximations for genuinely NL physical systems. However, much still remains to be checked if these theories are to 
stand by themselves-not only as approximations or heuristic tools-e.g., as theories of gravity, generalizing standard 
gravity-as in the MOND paradigm-or as theories of electromagnctism. 

We saw, for example, that ghosts appear in the weak-field limit of all the versions of the PL theories that have a 
linear weak-field limit. But it has to be checked how deleterious they are. In the linear approximation these ghosts 
decouple altogether and do not affect physics; and it is not clear to what extent they survive nonperturbatively. For 
example, in the PL version of special-relativistic kinematics, where (decoupled) ghosts do appear in the linear limit, 
they seem to be harmless as the theory in full is equivalent to special relativity, and as we saw its Hamiltonian is 
always positive. We also saw that with a proper choice of the Lagrangian, these theories are equivalence to healthy 
theories, up to second order in the fields. It is also not clear to what extent such a problem arises in theories with no 
linear weak-field limit (for example, in MOND, the Lagrangian is nonanalytic in the fields, in the weak-field limit). 
Another fact from which we may draw hope, in this connection, is that the modes that appear ghostlike in the linear 
limit seem to be sourced by the phantom densities. These, in themselves, arc not independent of the actual charges; 
so it may be that a system never actually radiates negative energy to infinity, which is the basic problem with ghosts. 

Another subject for further study is the generalization of the concepts here to other NL theories. The construction 
of PL theories as described above hinges on the Lagrangian depending nonlinearly only on the first derivatives of 
the basic DoF. Can a sensible generalization be made to Lagrangians that depend also (nonlinearly) on the DoF 
themselves, or on higher derivatives. For example, for a single scalar, we may consider, instead of genuinely NL 
theories governed by a field Lagrangian of the form 

C = -U(<t> >li ;<f> tll , v ), (96) 

PL theories with 

This leads to linear, higher-derivative field equations 

However the affinity between the two theories is less clear. Even if we choose S(e^, e^ v ) to be the Legendre transform 
of U, the two theories will not coincide, in general, for 1-D configurations: it is true that in_this cases = V l( u f° r 
some ip, and e M „ = il>,n, v for some ip, but the PL theory requires the further constraint tj) = ip. 

Other possible generalizations are to PL analogs of generalizations of BI, e.g., certain versions of Dirac-Born- Infold in 
Minkowski apace-time, and in (fixed) curved space-time. These involve a set of scalar fields $ a , and in the Minkowski 
case we take 

C = El[l - (-|k„ + + iW£ ||) 1/2 ], (99) 

with summation over double indices. The PL theory involves also fields * a , and the lagrangian is 

-D$ a • D*° - \f^Q^ + S(^, Q^). (100) 

There are also many interesting questions regarding properties of these theories and the the extent of their similarity 
to their genuinely NL kindred. 



(97) 



(98) 
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VII. APPENDIX: THE SHORT-DISTANCE FORCE IN BORN-INFELD ELECTROSTATICS 



Consider two charges of the same sign, q 2 at the origin, and 91 {\q\ \ < \q2\) at x — £ on the x axis, in three (space) 
dimensions. We seek to calculate the force between them in BI electrostatics, in the limit £ — > 0. The field V</> has 
three finite critical points: two on the charges, and one in between them where = at point o. Take as integration 
surface, E, for calculating the force on q\ from eq.(90), the "watershed" surface, which passes though o, and which 
separates the field lines ending on the two charges, completed at infinity on the side of q\. On E, is tangent to 
E. It was shown in [16] that for a charge distribution of a uniform sign, everywhere points to (or away from) 
the convex closure of the charge distribution, in our case the segment connecting the two charges. Thus all field lines 
become radial at large distances r 3> £, and E approaches a cone of half opening angle 9. Apply Gauss theorem to 
cq.(69) in the volume within E, which contains only q\. Only the integral at infinity contributes, and we get that the 
solid angle subtended by E at infinity, Q, is given by fi/47r = (1 — cos9)/2 = 91/(91+92)- 

Now look at expression (90) for the force. The contribution to it from infinity is seen to vanish. On the rest of E, 
is perpendicular to da; so we have 

uda. (101) 

Since u < the force is repulsive. (Since the tangent to E always points to the inter-charge segment, the x component 
of da is everywhere nonpositive.) Divide the integral to the contributions from radii r < k£, and r > k£, with k> 1 
fixed. Since |u| is bounded by Eq, the first contribution is bounded by E 2 (k£) 2 , which vanishes in the limit £ — > 0. 
Beyond k£ the two charges are seen approximately as one charge qi +92, the field is radial, and to a very good 
approximation u(\V(f>\ 2 ) may be replaced by its known expression for the spherical, point-charge case: 

u = -El[l-{l + ^)- l /% (102) 

where y = [(91 + q 2 ) / ^r 2 ] 2 is the Coulomb field squared, and r is the distance from the origin. This approximation is 
arbitrarily good for large enough k. The integral beyond k£ can now be extended back to the origin with expression 
(102) for u, again because the integral from the origin to n£ vanishes in the limit £ — > 0. The expression for the 
force in the limit is then the integral (101), with u from eq.(102), calculated on the circular-cross-section cone of half 
opening angle 9, around the x axis. From symmetry, only the x component is finite, and in the limit £ — > is 



F( qi ,q 2 ,£^0) ^27rsin 2 9E 2 f rdr[l - (1 + ^)" 1/2 ]. 

Jo 



(103) 



Integrating, and substituting the expression for 9 in terms of the charges, we get 



F(q uq2 ,£^0)^^^- r (104) 
1 9i + 92 1 

Following the same calculation in two dimensions gives 

cv , n\ ,^o|9i+92| . / ft \ , lntn 
F(qi,q 2 ,£ — > 0) ->• sin tt . (105) 

V 91 + 92 / 

This is the same as the result found in [17], where two-dimensional BI electrostatics had been considered. 
In D dimensions, work in spherical coordinates around the charge axis. The volume element is then 

dV = r D_1 sin D ~ 2 9sin D ~ 3 ipi...sin(p D _ 3 drd9d(pi...d(p D _ 2 (106) 

(0 < 9 < n, < <fk < 27r). The D — 1 area on the cone of constant 9 between r and r + dr is 

dA = sin D - 2 91 D r D - 2 dr, (107) 

where I D is the integral over the ipk m the expression for dV: I D = J sin D ~ 3 tpi...sinLp D _ 3 d(pi...d(p D _ 2 - This integral 
also appears in the expression for the D-dimensional solid angle, a D = X D J* sin D ~ 2 9d9. Now, the Coulomb field of 
a point mass Q is \Vip\ — \Q\/a D r D ~ 1 . Inserting in the expression for u and integrating, as before, the ratio X D /a D 
appears, and we find for the short-distance-limit force 

D-1 X* K ' 
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where I* D = sin D ~ 2 9d9 — ttT(D — l)/2 D ~ 2 [F(Z?/2)] 2 . Finally, we express 9, and hence the force, in terms of the 
charges. Again, from Gauss's law, the field lines ending at any of the charges, subtend at infinity a solid angle in 
proportion to that charge. So, the opening angle 9 is given by 

e 

sm D - 2 pdp =l* D y-. (109) 

To lowest order in qi/Q, eq.(109) tells us that 9 D ~ 1 m (D — l)X*qi/Q, and F « £7o|<7i|, as expected. For equal charges 
qi = q 2 = q, 9 = it/2 and 

2^[T(D/2)f 

F ^ n(D-l)T(D-l) Eolql (110) 

I have not been able to fully derive similar results for the PL theory. In this case, there is, generally, no common 
"watershed" surface for both potentials. If we use as integration surface the 'watershed' of ip, we get for the force 



i 



-{s + V<f>-Vip)da + Vip(f(j>-da). (Ill) 



As before, divide the integral to the contributions for radii below and above kL For k » 1 the fields in the large-radius 
region become the fields for a point charge Q, and £ becomes a radial cone. Thus V<p becomes perpendicular to da, 
so the third term in the integrand contributes negligibly. Also, it is easy to ascertain that for spherical configurations 
— (s + V(f> ■ VVO = u (this expression then becomes minus the Legendre transform of s, which equals u). Thus, the 
contributions to the force integrals from the region r > n£ are the same in the two theories for n 3> 1. It is also 
clear that if we use in the integral the fields for the point-mass Q, the contribution from the region r < k£ vanishes 
in the limit I — > 0, for fixed k. However, I have not been able to show that this is also the case for the small-radii 
contribution to the actual integral. In the BI case, the integrand u is bounded; so this contribution vanishes at least 
as fast as k£) 2 . If this can be shown to be the case for the PL theory, we would get the same expression for the force. 
But in the PL theory, the contribution from r < k£ may be finite, in which case the forces differ in the two theories. 
This remains an open question. 

Note, finally, that the above derivation for the BI case can be used to calculate the short-distance force for other 
configurations involving point charges. For example, consider N charges, qi, of equal sign, on a segment of the x 
axis (i increasing in the positive x direction). The force on any q^, in the limit of shrinking configuration, can be 
calculated by subtracting the integrals over the two 'watershed" surfaces flanking this charge, giving 



E^qk 



fe-l N 

■(X>-I>) ( 112 ) 

1 fe+1 



Fk -> 

Q — — 
k+i 



(Q = J2i Qi)- (We cannot apply the same to a general point-charges configuration, because we do not know, in 
general, the asymptotic shape of the 'watershed' for each charge.) 

Another example is the short-distance force on each charge in a system of N equal charges placed symmetrically on 
the vertices of a polygon, or a symmetric polytope (cube, tetrahedron). It is based on the fact that the 'watersheds' 
are symmetry surfaces, and so are known. For example, for TV equal charges at the vertices of a regular polygon, £ 
is made of two half-planes at an angle 2ir/N to each other. So the integration would yield the same value as in he 
two-charge case, but now the planes make an angle tt/N with the charge axis. So the result has to be multiplied by 
sin(n/N). Also, in the expression for the asymptotic field we have to take Nq as the total charge. This gives for the 
force: 

F=±E N\q\8in(£). (113) 

For 8 charges q on the vertices of a cube, we have E made of three quarter-planes. We then get F = (3 1//2 /2)|g|£V 
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